Trulia is seeking a database software engineer to join its data warehousing, analytics, CRM and reporting team. The successful candidate has experience building and maintaining data-intensive applications to support marketing, analytics and CRM efforts, as well as designing, planning and implementing robust data feeds with high accuracy and short turnaround times.
To be succesful in this role, you should be familiar with the following technologies: • automated data collection and crunching techniques (shell, MySQL, Java, python or php programming) • data aggregation, manipulation, and analysis ("ETL" processes) • statistics and summary data validation • automated QA • data warehousing • publishing reports to Web servers or other delivery systems • end-user software such as Microsoft Excel, its capabilities and its limitations when it comes to handling large data sets
Responsibilities
Include: • interfacing with engineers and analysts to determine data collection needs • designing, planning, documenting and executing data queries (database queries, etc) • working with the team to determine which data sets are one-offs and which deserve a more long-term investment, and act on those determinations (stored procedures, views, custom reports, etc) • creating and maintaining ETL processes for the sales, marketing and operations teams • pulling ad-hoc data and reports as needed
Requirements
• 4+ years' experience in software development with strong software engineering skills (python or php, Java) • 4+ years' experience working with large data sets and MySQL 4 or 5 • Solid practical knowledge of database technologies (normalization, storage engines, data partitioning, indexes, query analysis and optimization, dimensions) • Solid knowledge of the unix/linux environment and standard tools • Excellent communication skills • Ability to work effectively under time pressure and be comfortable working with cross-functional teams of data users with varying degrees of technical knowledge
Nice to have: • Experience with map-reduce technology (Hadoop) • Experience with other RDBMS systems (PostgreSQL, Oracle, etc) • Experience working with OLAP tools such as Business Objects • Experience interfacing with third-party data stores (e.g. Salesforce, SugarCRM) via various APIs and API styles (SOAP, REST, etc)